Picture for Luyao Niu

Luyao Niu

JobBench: Aligning Agent Work With Human Will

Add code
May 25, 2026
Viaarxiv icon

Polyhedral Instability Governs Regret in Online Learning

Add code
May 13, 2026
Viaarxiv icon

The WidthWall: A Strict Expressivity Hierarchy for Hypergraph Neural Networks

Add code
May 13, 2026
Viaarxiv icon

Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty?

Add code
May 12, 2026
Viaarxiv icon

ST-ProC: A Graph-Prototypical Framework for Robust Semi-Supervised Travel Mode Identification

Add code
Nov 17, 2025
Viaarxiv icon

Event-CausNet: Unlocking Causal Knowledge from Text with Large Language Models for Reliable Spatio-Temporal Forecasting

Add code
Nov 16, 2025
Viaarxiv icon

VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL

Add code
May 29, 2025
Viaarxiv icon

SOSBENCH: Benchmarking Safety Alignment on Scientific Knowledge

Add code
May 27, 2025
Viaarxiv icon

Temporal Sampling for Forgotten Reasoning in LLMs

Add code
May 26, 2025
Viaarxiv icon

TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning

Add code
May 20, 2025
Figure 1 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Figure 2 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Figure 3 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Figure 4 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Viaarxiv icon